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Bacillus tusciae Bonjour & Aragno 1994 is a hydrogen-oxidizing, thermoacidophilic spore 
former that lives as a facultative chemolithoautotroph in solfataras. Although 16S rRNA gene 
sequencing was well established at the time of the initial description of the organism, 1 6S se- 
quence data were not available and the strain was placed into the genus Bacillus based on 
limited chemotaxonomic information. Despite the now obvious misplacement of strain T2^ as 
a member of the genus Bacillus in 16S rRNA-based phylogenetic trees, the misclassification 
remained uncorrected for many years, which was likely due to the extremely difficult, analy- 
sis-hampering cultivation conditions and poor growth rate of the strain. Here we provide a 
taxonomic re-evaluation of strain T2^ (= DSM 2912 = NBRC 15312) and propose its reclassi- 
fication as the type strain of a new species, Kyrpidia tusciae, and the type species of the new 
genus Kyrpidia, which is a sister-group of Alicyclobacillus. The family Alicyclobacillaceae da 
Costa and Rainey, 201 0 is emended. The 3,384,766 bp genome with its 3,323 protein-coding 
and 78 RNA genes is part of the Genomic Encyclopedia of Bacteria and Archaea project. 



Introduction 



Strain T2t [= DSM 2912 = NBRC 15312) is the type 
strain of the Bacillus tusciae [1]. Both strain T2t 
and strain T201 were isolated from ponds in the 
solfatara of San Federigo, a geothermal area near 
Lago, Tuscany, Italy [l,Table 1], as indicated in the 
Latin species epithet tusciae, meaning from Tuscia 



[1]. In the original study of Bonjour and Aragno 
[1], DNA:DNA reassociation studies of strains T2t, 
Bacillus schlegelii [also autotrophic) and the ther- 
moacidophilic Bacillus acidocaldahus (later on 
reclassified as Alicyclobacillus acidocaldahus) 
from another hot spring were carried out. Because 
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of the low levels of hybridization observed, the 
authors proposed that strains T2t and T201 
formed a distinct taxonomic unit [1]. However, 
only limited chemotaxonomic data was available 
at the time and 16S rRNA sequence data was not 
available, which led the authors to place the spe- 
cies into the genus Bacillus [1]. Rather difficult cul- 
tivation conditions and the poor growth rate of 
strain T2t likely delayed an earlier re- 
classification although Rainey et al. previously 
noted the link to the genus AUcyclobacillus [19]. 
This was recently corroborated by the observation 
that strain T2t as well as A. acidocaldarius lack the 
sspE gene for acid-soluble spore proteins fre- 
quently found in members of the Bacillales [20]. 
Here we present a summary classification and a 
set of features for B. tusciae strain T2t, a descrip- 
tion of the complete genome sequencing and an- 
notation and a proposal to reclassify B. tusciae as a 
member of the new genus Kyrpidia as Kyrpidia 
tuscae comb. nov. 

Classification and features 

A representative genomic 16S rRNA sequence of 
B. tusciae T2''' was compared using NCBI BLAST 
[21] under default settings (e.g., considering only 
the high-scoring segment pairs (HSPs] from the 
best 250 hits) with the most recent release of the 
Greengenes database [22] and the relative fre- 
quencies of taxa and keywords (reduced to their 
stems [23]) were determined, weighted by BLAST 
scores. The most frequently occurring genera 
were AUcyclobacillus (67.8%), Bacillus (18.2%), 
Thermoactinomyces (6.0%), Paenibacillus (5.6%) 
and Exiguobacterium (1.6%) (99 hits in total). Re- 
garding the seven hits to sequences from mem- 
bers of the species, the average identity within 
HSPs was 99.6%, whereas the average coverage 
by HSPs was 99.5%. Among all other species, the 
one yielding the highest score was A. acidiphilus 
NR_028637, which corresponds to an identity of 
92.4% and an HSP coverage of 57.7%. (Note that 
the Greengenes database uses the INSDC (= 
EMBL/NCBI/DDBJ) annotation, which is not an 
authoritative source for nomenclature or classifi- 
cation.) The highest-scoring environmental se- 
quence was EU638396 ('Ecological Role Firmi- 
cutes Identified Thermophilic Microbial Fuel Cells 
thermophilic microbial fuel cell acetate-fed expe- 
riment clone SHBZ1905')), which showed an iden- 
tity of 99.4% and an HSP coverage of 90.0%. 



The most frequently occurring keywords within 
the labels of all environmental samples that were 
hits were 'microbi' (13.3%), 'thermophil' (12.5%), 
'cell, fuel' (12.4%), 'ecolog, firmicut, identifi, role' 
(6.2%) and 'experi' (6.1%) (151 hits in total). The 
most frequently occurring keywords within the 
labels of those environmental samples which 
yielded hits of a higher score than the highest 
scoring species were 'microbi' (13.9%), 'cell, fuel, 
thermophil' (13.1%), 'ecolog, firmicut, identifi, 
role' (6.5%), 'experi' (6.4%) and 'acetate-f (4.7%) 
(124 hits in total). These keywords corroborate 
the features of the environment from which strain 
T2T was isolated. 

Figure 1 shows the phylogenetic neighborhood of 
B. tusciae in a 16S rRNA tree. The sequences of the 
five 16S rRNA gene copies in the genome differ 
from each other by up to eight nucleotides, and 
differ by up to six nucleotides from the previously 
published 16S rRNA sequence AB042062. 

B. tusciae cells are straight rods measuring 0.8 x 4- 
5 \im length (Figure 2) with single laterally in- 
serted flagella [1]. In young cultures, cells stain 
Gram-positive and exhibit oval, subterminal 
spores that swell the sporangium [1]. Cultures 
grow at 55°C at a pH ranging from 4.2 to 4.8 [1]. 
Although cells grow best under autotrophic condi- 
tions via hydrogen oxidation, they can also grow 
heterotrophic on alcohols, amino acids and short 
chain fatty acids, but not with sugars [1]. Sub- 
strate usage was described in detail by Bonjour 
and Aragno [1]. Cells contain a soluble malate- 
dehydrogenase activity, which cannot reduce py- 
ridine (NADVNAD(P)+) [1], nor NADH oxidation 
was observed [1]. Hydrogenase activity was re- 
ported as being inducible [1]. An operational Cal- 
vin cycle was reported based on the presence of 
ribulose-l,5-bisphosphate carboxylase activity in 
autotrophically-grown cells [1]. Autotrophically 
grown cells show inclusions of poly-p- 
hydroxybutyric acid [1]. 

Chemotaxonomy 

The structure of the cell wall of strain T2t was re- 
ported as typically Gram-positive, without a pro- 
tein coat [1]. (jL)-alicyclic fatty acids are lacking and 
menaquinone MK-7 was identified as the main 
quinone (personal communication Brian J. Tin- 
dall). 
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Table 1. Classification and general features of Kyrpidia tusciae strain T2^ according to the MIGS recommenda- 
tions [2] and the NamesforLife database [3]. 



MIGS ID Property 



Term 



Evidence code 



Current classification 



Domain Bacteria 
Phylum Firmicutes 
Class Bacilli 
Order Bacillales 
Family Bacillaceae 
Genus Bacillus 
Species Bacillus tusciae 
Type strain T2 



TAS [4] 
TAS [5-7] 
TAS [8,9] 
TAS [10,11] 
TAS [10,12] 
TAS [10,13,14] 
TAS[1,15] 
TAS [1] 







Family Alicyclobacillaceae 


TAS [9,16] 




Revised classification 


Genus Kyrpidia 


NAS 






Species Kyrpidia tusciae 


NAS 




Gram stain 


positive 


TAS [1] 




Cell shape 


straight rods 


TAS [1] 




Motility 


not reported, but lateral flagella visible 


TAS [1] 




Sporulation 


sporulating 


TAS [1] 




Temperature range 


thermophile, grows > 47°, < 65°C 


TAS [1] 




Optimum temperature 


55°C 


TAS [1] 




Salinity 


not reported 




MIGS-22 


Oxygen requirement 


aerobic 


TAS [1] 




Carbon source 


short chain fatty acids, amino acids and alcohols 


TAS [1] 




Energy metabolism 


facultatively chemolithoautotroph 


TAS [1] 


MIGS-6 


Habitat 


hot, acidic solfatara fields 


TAS [1] 


MIGS-15 


Biotic relationship 


free living 


TAS [1 7] 


MIGS-14 


Pathogenicity 


none 


NAS 




Biosafety level 


1 


TAS [1] 




Isolation 


ponds in solfatara 


TAS [1] 


MIGS-4 


Geographic location 


San Frederigo, near Lago, Tuscany (Italy) 


TAS [1] 


MIGS-5 


Sample collection time 


about or before 1 984 


TAS [1] 


MIGS-4.1 


Latitude 


43.33 


NAS 


MIGS-4.2 


Longitude 


10.50 




MIGS-4.3 


Depth 


not reported 




MIGS-4.4 


Altitude 


about 1 55 m 


NAS 



Evidence codes - TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non- 
traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally 
accepted property for the species, or anecdotal evidence). These evidence codes are from of the Gene Ontolo- 
gy project [18]. 



Genome sequencing and annotation 

Genome project history 

This organism was selected for sequencing on the 
basis of its phylogenetic position [34], and is part of 
the Genomic Encyclopedia of Bacteria and Archaea 
project [35]. The genome project is deposited in the 
Genomes OnLine Database [31] and the complete 
genome sequence in GenBank. Sequencing, finishing 
and annotation were performed by the DOE Joint 
Genome Institute QGI). A summary of the project 
information is shown in Table 2. 



Growth conditions and DNA isolation 

B. tusciae strain TZ^, DSM 2912, was grown in DSMZ 
medium 369 [Bacillus tusciae medium) [36] at 50°C. 
DNA was isolated from 1-1.5 g of cell paste using 
Qiagen Genomic 500 DNA Kit [Qiagen, Hilden, Ger- 
many) with lysis modification st/LALMP according 
to Wu et al. [35]. DNA is available through the DNA 
Bank Network [37]. 
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Lace^ella tengchongensis (FJ426598) 
1 oroflSf J'^"^ sacchari (AJ251779) 
Laceyella putida (AF1 38736) 
r- Thermoactinomyces vulgaris (AF138739) 
^ Thermoactinomyces intermedius (AF138734) 

Thermoflavimicrobium dichotomicum (AF1 38733) 

Seinonella peptonophila (AF1 38735) 

Sr)imazuella kribbensis (AB049939) 

r Planifilum fulgidum (AB088362) 

1 00/1 00 



J 1 00/1 od Planitilum yunnanense (DQ11 9659) 
I— ^ Planifilum fimeticola (AB088364) 



Desmospora activa (AM940019) 

J- Mechercharimyces asporophorigenens (AB239532) 
^ Mechercharimyces mesophilus (AB239529) 



- Gemella haemolysans (L14326) 



Staphylococcus aureus (L36472) 

-Bacillus subtilis (AJ276351) " 

Listeria monocytogenes (X56153) 

Rummeliibacillus stabekisii (DO870754) 

2/77 

Solibacillus silvestris (AJ006086) 

Planococcus citreus (X621 72) 

Sporolactobacillus inulinus (ABW1595) 

Exiguobacterium aurantiacum (DQ019166) 



- Geomicrobium habphilum (AB449106) 



- Paenibacillus polymyxa (016276) 



- Alic^clobacillus hesperidum (AJ1 33633) 
J g^^licyclobacillus sacchari (AB264020) 

- Alicyclobacillus acidiphilus (AB076660) 

- Alicyclobacillus fastidiosus (AB264021) 
- Alicyclobacillus acidoterrestris (AB042057) 

- Alicyclobacillus vulcanalis (AY425985) 
J \j^/f^yclobacillus acidocaldarius (AJ496806) " 
- Alicyclobacillus sendaiensis (AB084128) 

— Alicyclobacillus macrosporangiidus (AB264025) 

- Alicyclobacillus cycloheptanicus (AB042059) 

r^lic^clobacillus shizuokensis (AB264024) 

Alicyclobacillus kakegawensis (AB264022) 



100/100 

I Alicyclobacillus herbarius (AB042055) 

— Alicyclobacillus tolerans (Z21979) 

Alicyclobacillus disutfidooxidans (AB089843) 



— Alicyclobacillus contaminans (AB264026) 
3 

Alicyclobacillus pomorum (AB089840) 

Alicyclobacillus lerrooxydans (EU137838) 



-Alicyclobacillus aeris (FM1 79383) 



- Bacillus tusciae (IMG2502459787) ' 



Tumebaciltus permanentifrigoris (D0444975) 

■ Tumebacillus ginsengisoli (AB245375) 



-Alicyclobacillus pohliae (AJ564766) 
Thermicanus aegyptius (AJ242495) 



0.03 



Figure 1. Phylogenetic tree highlighting the position of B. tusciae relative to the type strains within the families 
Alicyclobacillaceae and Thermoactinomycetaceae, which gave the best hits when conducting a BLAST search 
[22] against the sequences contained in the last release from the All-Species-Living-Tree Project [24], and the 
type strains of all other families within the order Bacillales. The tree was inferred from 1,403 aligned characters 
[25,26] of the 16S rRNA gene sequence under the maximum likelihood (ML) criterion [27]. Rooting was done 
initially using the midpoint method [28[ and then checked for its agreement with the current classification (Ta- 
ble 1). The branches are scaled in terms of the expected number of substitutions per site. Numbers adjacent to 
the branches are support values from 450 ML bootstrap replicates [29[ (left) and from 1,000 maximum parsi- 
mony (MP) bootstrap replicates [30[ (right) if larger than 60%. Lineages with type strain genome sequencing 
projects registered in GOLD [31 [ are labeled with one asterisk, those also listed as 'Complete and Published' 
with two asterisks [32,33[. 
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Figure 2. Scanning electron micrograph of B. tusciae strain T2^ 



Table 2. Genome sequencing project information 



MIGS ID 


Property 


Term 


MIGS-31 


Finishing quality 


Finished 


MIGS-28 


Libraries used 


Two genomic libraries: Sanger 8 kb pMCL200 and 


454 standard library 


MIGS-29 


Sequencing platforms 


ABI3730, 454 GS FLX 


MIGS-31. 2 


Sequencing coverage 


5.5 X Sanger; 25.2 x pyrosequence 


MIGS-30 


Assemblers 


Newbler version 2. 0.O-PreRelease-07/1 5/2008, phrap 


MIGS-32 


Gene calling method 


Prodigal 1.4, GenePRIMP 




Genbank ID 


CP002017 




Genbank Date of Release 


April 5, 2010 




GOLD ID 


Gc01268 




NCBI project ID 


31345 




Database: IMG-GEBA 


646564511 


MIGS-13 


Source material identifier 


DSM2912 




Project relevance 


Tree of Life, GEBA 



Genome sequencing and assembly 

The genome was sequenced using a combination 
of Sanger and 454 sequencing platforms. All gen- 
eral aspects of library construction and sequenc- 
ing can be found at the JGI website [38]. Pyrose- 
quencing reads were assembled using the Newb- 
ler assembler (Roche). Large Newbler contigs 
were broken into 3,650 overlapping fragments of 
1,000 bp and entered into assembly as pseudo- 
reads. The sequences were assigned quality scores 
based on Newbler consensus q-scores with mod- 
ifications to account for overlap redundancy and 
adjust inflated q-scores. A hybrid 454/Sanger as- 
sembly was made using the phrap assembler [39]. 
Possible mis-assembUes were corrected with Dup- 
finisher and gaps between contigs were closed by 
editing in Consed, by custom primer walks from 
sub-clones or PGR products [40]. A total of 549 
Sanger finishing reads were needed to close gaps. 



to resolve repetitive regions, and to raise the qual- 
ity of the finished sequence. The error rate of the 
completed genome sequence is less than 1 in 
100,000. Together, the combination of the Sanger 
and 454 sequencing platforms provided 30.7 x 
coverage of the genome. The final assembly con- 
tains 18,870 Sanger reads and 413,112 pyrose- 
quencing reads. 

Genome annotation 

Genes were identified using Prodigal [41] as part 
of the Oak Ridge National Laboratory genome an- 
notation pipeline, followed by a round of manual 
curation using the JGI GenePRIMP pipehne [42]. 
The predicted CDSs were translated and used to 
search the National Center for Biotechnology In- 
formation [NCBI) non-redundant database, Uni- 



http://standardsingenomics.org 



125 



Kyrpidia tusciae comb. nov. 



Prot, TIGR-Fam, Pfam, PRIAM, KEGG, COG, and In- 
terPro databases. Additional gene prediction anal- 
ysis and functional annotation was performed 
within the Integrated Microbial Genomes - Expert 
Review platform [43]. 

Genome properties 

The genome consists of a 3,384,766 bp long circu- 
lar chromosome with a G+C content of 59.1% 



[Table 3 and Figure 3). Of the 3,401 genes pre- 
dicted, 3,323 were protein-coding genes, and 78 
RNAs; 173 pseudogenes were also identified. The 
majority of the protein-coding genes [70.7%) 
were assigned a putative function while the re- 
maining ones were annotated as hypothetical pro- 
teins. The distribution of genes into COGs func- 
tional categories is presented in Table 4. 




Figure 3. Graphical circular map of the chromosome. From outside to the center: Genes on forward strand (color by 
COG categories), Genes on reverse strand (color by COG categories), RNA genes (tRNAs green, rRNAs red, other 
RNAs black), GC content, GC skew. 
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Table 3. Genome Statistics 



Attribute 


Value 


% of Total 


Genome size (bp) 


3,384,766 


100.00% 


DNA coding region (bp) 


2,896,588 


85.58% 


DNAG+C content (bp) 


2,000,875 


59.1 1% 


Number of replicons 


1 




Extrachromosomal elements 


0 




Total genes 


3,401 


100.00% 


RNA genes 


78 


2.29% 


rRNA operons 


5 




Protein-coding genes 


3,323 


97.71% 


Pseudo genes 


173 


5.09% 


Genes with function prediction 


2,404 


70.69% 


Genes in paralog clusters 


718 


21.11% 


Genes assigned to COGs 


2,456 


74.21% 


Genes assigned Pfam domains 


2,657 


78.12% 


Genes with signal peptides 


530 


15.58% 


Genes with transmembrane helices 


728 


21.41% 


CRISPR repeats 


4 





Table 4. Number of genes associated with the general COG functional categories 



Code 


value 


%age 


Description 


J 


153 


5.6 


Translation, ribosomal structure and biogenesis 


A 


0 


0.0 


RNA processing and modification 


K 


198 


7.2 


Transcription 


L 


192 


7.0 


Replication, recombination and repair 


B 


1 


0.0 


Chromatin structure and dynamics 


D 


36 


1.3 


Cell cycle control, cell division, chromosome partitioning 


Y 


0 


0.0 


Nuclear structure 


V 


44 


1.6 


Defense mechanisms 


T 


121 


4.4 


Signal transduction mechanisms 


M 


116 


4.2 


Cell wall/membrane/envelope biogenesis 


N 


62 


2.3 


Cell motility 


Z 


1 


0.0 


Cytoskeleton 


W 


0 


0.0 


Extracellular structures 


u 


41 


1.5 


Intracellular trafficking, secretion and vesicular transport 


o 


95 


3.5 


Posttranslational modification, protein turnover, chaperones 


c 


217 


7.9 


Energy production and conversion 


G 


111 


4.1 


Carbohydrate transport and metabolism 


E 


255 


9.3 


Amino acid transport and metabolism 


F 


68 


2.5 


Nucleotide transport and metabolism 


H 


133 


4.9 


Coenzyme transport and metabolism 


1 


145 


5.3 


Lipid transport and metabolism 


P 


133 


4.9 


Inorganic ion transport and metabolism 


Q 


97 


3.5 


Secondary metabolites biosynthesis, transport and catabolism 


R 


298 


10.9 


General function prediction only 


S 


224 


8.2 


Function unknown 




945 


27.8 


Not in COGs 
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Insights into the genome 

At the time when the species name B. tusciae was 
vahdly pubHshed [9], a 16S rRNA gene sequence 
of the strain was not generated, but DNA-DNA hy- 
bridizations (DDHs) were performed with two 
reference strains, B. schlegelii [44] and B. acido- 
caldahus [45], to show that strain T2t represents 
a novel species within the genus Bacillus [1]. 
While B. schlegelii was named by the same senior 
author as B. tusciae and most probably is as mis- 
placed in the genus Bacillus as B. tusciae (see be- 
low), B. acidocaldahus was subsequently reclassi- 
fied as Alicyclobacillus acidocaldahus [46]. In re- 
trospect, considering 16S rRNA gene sequence 
identities of 88.8% (for A. acidocaldahus] and 
85.7% (for B. schlegelii] it is not surprising that 
Bonjour and Aragon observed only 17% and 0% 
DDH similarity, respectively, for their novel isolate 
[1]. While the members of the genus Alicyclobacil- 
lus do in fact represent the closest relatives of 
strain T2t (Figure 1) with 87.5% to 89.0% 16S 
rRNA sequence identity in EzTaxon [47], the ap- 
propriate taxonomic placement of B. schlegelii 
seems to be an unresolved question, with no 
greater than 89.3% rRNA sequence identity found 
with other Xy^pe strains, none of which are mem- 
bers of the genus Bacillus [47]. Figure 1 shows B. 
tusciae as a sister group of one of the two clades 
that comprise Alicyclobacillus, with A. pohliae 
more closely related to Tumebacillus than to the 
other members of the genus and likely being mis- 
classified based on the current data. However, 
while the clade comprising B. tusciae, Alicycloba- 
cillus and Tumebacillus has significant statistical 
support (100% under ML and 97% under MP), the 
claim that the clade comprises only Tumebacillus 
and A. pohUae is unsupported. 

As two of the genera selected for Figure 1, Alicyc- 
lobacillus and Bacillus, appeared as polyphyletic in 
the tree, we conducted both unconstrained heuris- 
tic searches for the best tree under the ML [27] 
and MP criterion [30] as well as searches con- 
strained for the monophyly of these two genera, 
respectively (for details of the data matrix see the 
figure caption). The best-known ML tree had a log 
likelihood of -13,289.73, whereas the best trees 
found under the constraint of Alicyclobacillus mo- 
nophyly had a log likelihood of -13,297.23 and 
was not significantly worse in the Shimodaira- 
Hasegawa test as implemented in RAxML [27] 



(a = 0.05). However, when enforcing Bacillus mo- 
nophyly, the resulting log likelihood was - 
13,412.24, significantly worse than the best tree 
(a = 0.01). The best-known MP trees had a score 
of 2,362, whereas the best trees found under the 
constraint of Alicyclobacillus monophyly had a 
score of 2,374 and were not significantly worse in 
the Kishino-Hasegawa test as implemented in 
PAUP* [30] (a = 0.05). When enforcing Bacillus 
[Bacillus subtilis and B. tusciae] monophyly, the 
resulting score was 2,439, significantly worse than 
the best tree (p < 0.0001). (See, e.g., chapter 21 in 
[48] for an in-depth description of such paired- 
site tests.). Accordingly, the current classification 
of B. tusciae in Bacillus is at odds with the 16S 
rRNA data and does not reflect the natural rela- 
tionships based on that gene. In contrast, the 
placement of A. pohliae in Alicyclobacillus is not 
significantly disputed by the data. 

Table 5 shows the whole-genome distances be- 
tween B. tusciae, A. acidocaldahus [32] and B. sub- 
tilis [33] as calculated using the genome-to- 
genome distance calculator [49-51]. The lower left 
triangle shows those distances derived by dividing 
the total sequence length not covered by HSPs 
through total genome length (left) and by dividing 
the total number of non-identical base pairs with- 
in HSPs by total HSP length (right); the upper right 
triangle shows the distance derived by dividing 
total genome length minus total number of iden- 
tical base pairs within HSPs by total genome 
length. As expected, those distances relating HSP 
coverage and number of identical base pairs with- 
in HSPs to total genome length are higher between 
B. tusciae and B. subtilis than between B. tusciae 
and A. acidocaldahus. That the distances relating 
the number of identical base pairs to total HSP 
length behave differently indicates that the ge- 
nomic similarities between B. tusciae and B. subti- 
lis are more strongly restricted to more conserved 
sequences, a kind of saturation phenomenon [50]. 
Figure 4 shows an unrooted phylogenetic network 
inferred using the Neighbor-Net algorithm from 
whole-genome distances calculated with GGDC 
[49-50]. The grouping of B. tusciae and A. acido- 
caldahus, as well as the very tree-like appearance 
of this part of the network indicate that genomic 
data are also in conflict with the placement of B. 
tusciae within Bacillus. 
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Table 5. Genome-to-genome distances as calculated using GGDC [49-51 





B. tusciae 


B. subtilis subsp. subtilis 


A. acidocaldarius 


B. tusciae (CP002017 = NC_014098) 


0.0000 


0.9916 


0.97030 


B. subtilis subsp. subtilis (AL009126 = 
NC_000964) 


0.9902/0.1452 


0.0000 


0.9908 


A. acidocaldarius (CPOOl 727-001 730 = 
NC_013205, 07, 08) 


0.9646/0.1629 


0.9893/0.1366 


0.0000 



BacUius subtilis subsp. subtilis str. 168 AL009126 
Baciitus amyioliquefaciens DSM 7 FN597644 



Anoxybaciilus flafithermus WKl CP000922 



Bacillus lictieniformis ATCC 14580 AE017333 

Bacillus selenitireducens MLSIO CP001791 



Bacillus cereus ATCC 14579 AE016877 



Bacillus ceilulosiiytlcus DSM 2522 CP002394 



Oceanobacilius Iheyensis HTE831 BA000028 




Alicyclobacillus acidocaldarius subsp. acidocaldarius DSM 446 CP001727 



Bacillus tusciae DSM 2912 CP002017 



Exiguobacterium sibiricum 25515 CP001022 



^ Listeria seeligerl serovar 12b str. 5LCC3954 FN557490 
Listeria welshlmeri serovar 6b str. SLCC5334 AM263198 



StapLiylococcus saprophyticus subsp. saprophyticus ATCC 15305 AP00S934 



Figure 4. Unrooted phylogenetic network inferred with the Neighbor-Net algorithm as implemented in SplitsTree 
version 4.10 [52] from genome-to-genome distances calculated using GGDC [49-51 [. The analysis includes all 
completely sequenced Bacillales type strain genomes as registered in GOLD at the time of publication [31 [. Here, the 
logarithmic version of the distance calculated as the total genome length minus total number of identical base pairs 
within HSPs, divided by total genome length (see Table 5), was used as GGDC distance. 



6. subtilis 
(4,176) 




B. tusciae 
(3,323) 



A. acidocaldarius 
(3,150) 



Figure 5. Venn diagram depicting the intersections of protein sets between the 
target genome and representative genomes of Bacillus and Alicyclobacillus. 
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Table 6. Typical features of reference taxa 




Bacillus tusciae T2^ DSM 
2912 [1] 


Genus Alicyclobacillus 

[53,54] 


Genus Bacillus [54] 


Cell shape 


straight rods, 4-5 ^m long 


rods, 1 .5-6.3 |jm long 


rods, normally up to 5 \\m 
long 


Gram stain 


positive 


positive 


positive 


Oxygen requirement 


aerobic, facultative 
anaerobic, but not w^ith 
nitrate 


aerobic, facultative anaerobic 


aerobic, facultative 
anaerobic, a few species 

are strictly anaerobic 


Endospores 


subterminal, oval 


terminal up to subterminal 


ellipsoidal, central 


sspE genes for acid- 

growth temperature 
pH optimum 


missing 

>47°C to <67°C, opt. 55°C 
4.2-4.8 


missing 

4-70°C, opt. 35-65°C 
< 4.5, (range 1 .5-5) 


frequent 

10-60°C 
5-10 


Phenotype 


thermoacidophile 


thermoacidophiles 


some obligate or 
facultative thermophiles 


Habitat 


ponds in solfatara of 
geothermal area 


acidic geothermal sites (soil 
and water), fruit juices, ores 


often saprophytes in 
decaying organic matter 


Flagellation 


single lateral flagellum 


motility reported for some 
species 


motile or nonmotile; lateral 


Menaquinones 


MK-7 

(personal communication 
Brian J. Tindall) 


MK-7 


MK-7 


Major membranous 
lipid components 


no o)-alicyclic fatty acids 
{personal communication 
Brian J. Tindall) 


Large amounts (>80%) of to 
alicyclic fatty acids with six- 
or seven carbon rings, such as 
oi-cyclohexane undecanoic 
acid -C,7.o and a)-cyclohexane 
tridecanoic acid -C,9.o. Three 

JL/CV-ICD \A\J ll\J\. yj\Jjj\Zjj LI ICjC 

fatty acids. Some strains are 
known to form hopanoids. 


cellular fatty acids: ai-Cij.o, 
i-Ci5:o, ai-Ci5:o; no a>- 
alicyclic fatty acids 


Oxidase/catalase 


weak/w^eak 


±/± 


±/± 


Inclusion bodies 


poly-P-hydroxybutyric 
acid w^hen grown 
autotrophically under 
ammonium starvation 


not specified 


some species 


Energy metabolism 


grows best under 

autotrophic conditions, 
chemolithoautotrophic 
with Hj and CO2, but also 
chemoorganoheterotroph- 
ic; does not metabolize 
sugars 


chemoorganotroph ic or 
mixotrophic; carbohydrates, 
organic acids and amino acids 
can be utilized. Mixotrophic 
species utilize Fe^* and S" 


chemoorganotroph ic 
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The fraction of shared genes in the genomes of B. tus- 
ciae T2T A acidocaldahus [32] and B. subtilis [33] is 
shown in a Venn diagram [Figure 5). The numbers of 
pairwise shared genes were calculated with the phy- 
logenetic profiler function of the IMG-ER platform 
[43]. The homologous genes within the genomes 
were detected with a maximum e-value of lO-s and a 
minimum identity of 30%. 

A total of 1,363 of the genes are shared by all 
three genomes, with about equal numbers of 
genes (295-387) shared between pairs of ge- 
nomes to the exclusion of the third genome. With- 
in the 1,278 unique genes of B. tusciae that have 
no detectable homologues in the genomes of A 
acidocaldarius and B. subtilis [under the sequence 
similarity thresholds used for the comparison) are 
the genes encoding the key enzymes for the Calvin 
cycle: ribulose-l,5-bisphosphate carboxylase 
[Btus_2871 large subunit, Btus_2872 small sub- 
unit) and the phosphoribulokinase (Btus_2868). A 
closer look at the genome of B. tusciae revealed 
additional genes coding for enzymes of the Calvin 
cycle located in the immediate neighborhood of 
the ribulose-l,5-bisphosphate carboxylase genes: 
phosphoglyceratekinase (Btus_2865), fructose- 
bisphosphate aldolase (Btus_2867), phosphoribu- 
lokinase (Btus_2868), glyceraldehyde-3- 
phosphate dehydrogenase [Btus_2869), fructose- 
1,6-bisphosphatase (Btus_2870) ribulose-1,5- 
bisphosphate carboxylase [Btus_2871 large sub- 
unit, Btus_2872 small subunit) which are probably 
organized as an operon. In autotrophically-grown 
cells of B. tusciae, an active ribulose-1,5- 
bisphosphate carboxylase in an operating Calvin 
cycle was reported [1]. 

The phylogenetic position of strain T2t as shown in 
Figures 1 and 4, corroborated by its primarily auto- 
trophic energy metabolism [1] (Tables 1 and 6), and 
the lack of sspE genes for acid-soluble spore proteins 
[20] [Table 6), indicated a clear need to reclassify B. 
tusciae. The joint but distinct phylogenetic position 
[Figure 1) supports the establishment of a novel ge- 
nus rather than the inclusion of B. tusciae in Alicyclo- 
bacillus. As shown above, 16S rRNA data do not indi- 
cate the monophyly of the remammg Alicyclobacillus 
spp., but are not in significant conflict with it The 
branch leading to B. tusciae is rather long, indicating 
a considerable degree of independent evolution 
[relative to the other taxa included in the tree, which 
include the closest relatives of B. tusciae in the cur- 
rent LTP release, see above). Whether the lack of co- 
alicyclic fatty acids in A. pohliae and perhaps other 



Alicyclobacillus spp. can be used in later studies for a 
revision of Alicyclobacillus remains to be seen. 

On the basis of the above-mentioned physiological, 
chemotaxonomic and phylogenetic characteristics of 
strain T2t, a novel genus is proposed, Kyrpidia, as 
the second genus in the parent family Alicyclobacil- 
laceae, and a novel species is proposed, Kyrpidia tus- 
ciae sp. nov., comb. nov. Comparative characteristics 
of strain T2t are given in Table 1. 

Furthermore the phylogenetic analysis as shown in 
Figure 1 clearly supports the assignment of the ge- 
nus Tumebacillus to the family Alicyclobacillaceae. 

Emended description of the family Alicyclo- 
bacillaceae da Costa and Rainey 201 0 

The description of the family Alicyclobacillaceae is 
given by da Costa and Rainey 2010 in [54]. Acid 
may be produced from carbohydrates or not. The 
family is comprised of the genera Alicyclobacillus, 
Tumebacillus and Kyrpidia. 

Description of Kyrpidia gen, nov. 

Kyrpidia [Kyr.pi'di.a N.L. fem. n. Kyrpidia named in 
honor of Nikolaos C. Kyrpides, a Greek-American 
genomics scientist, who co-initiated the Genomic 
Encyclopedia of Archaea and Bacteria). 

Cells are straight rods, 1.5 to 5 |im long, facultative- 
ly anaerobic. Gram-positive, chemolitooautotrophic 
or chemoorganoheterotrophic. Thermoacidophilic; 
growth occurs above 42°C and below 67°C, with an 
optimum at 55°C, and at pH 4.2-7.5. Endospores are 
formed, but sspE genes for acid-soluble spore pro- 
teins are not found. The predominant menaqui- 
none is MK-7. Major fatty acids are iso-Cis-.o and iso- 
Ci7:0; a)-alicyclic fatty acids are not present. The 
mol% G + C content of the type strain of the type 
species is 59.11 mol%. The type species is Kyrpidia 
tusciae. Kyrpidia is a member of the Alicyclobacilla- 
ceae. 

Description of Kyrpidia tusciae (Bonjour & 
Aragno 1 984) comb. nov. 

Kyrpidia tusciae {tus'ciae L. gen.n. tusciae from Tus- 
cia; named after Tuscia, a region in central Italy 
where the Etruscians [Tuscii) lived and where the 
organism was found.) 

Basonym: Bacillus tusciae Bonjour and Aragno 1984. 

The genus Kyrpidia is comprised of one species Kyr- 
pidia tusciae. The characteristics of the species are 
given in the genus description and the description 
given by Bonjour and Aragno [1]. 

The type strain is T2t [= DSM 2912 = NBRC 15312). 
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